Robust regression with compositional covariates
نویسندگان
چکیده
Many biological high-throughput datasets, such as targeted amplicon-based and metagenomic sequencing data, are compositional. A common exploratory data analysis task is to infer robust statistical associations between high-dimensional microbial compositions habitat- or host-related covariates. To address this, a general regression framework RobRegCC (Robust Regression with Compositional Covariates) proposed, which extends the linear log-contrast model by mean shift formulation for capturing outliers. includes sparsity-promoting convex non-convex penalties parsimonious estimation, data-driven initialization procedure, novel cross-validation selection scheme. The procedure implemented in R package robregcc. Extensive simulation studies show RobRegCC's ability perform simultaneous sparse outlier detection over wide range of settings. demonstrate seamless applicability workflow real gut microbiome dataset from HIV patients analyzed set species host immune response soluble CD14 measurements inferred.
منابع مشابه
Regression Discontinuity Design with Covariates
Regression Discontinuity Design with Covariates In this paper, the regression discontinuity design (RDD) is generalized to account for differences in observed covariates X in a fully nonparametric way. It is shown that the treatment effect can be estimated at the rate for one-dimensional nonparametric regression irrespective of the dimension of X. It thus extends the analysis of Hahn, Todd, and...
متن کاملSemiparametric Quantile Regression with High-dimensional Covariates.
This paper is concerned with quantile regression for a semiparametric regression model, in which both the conditional mean and conditional variance function of the response given the covariates admit a single-index structure. This semiparametric regression model enables us to reduce the dimension of the covariates and simultaneously retains the flexibility of nonparametric regression. Under mil...
متن کاملMisclassification in Logistic Regression with Discrete Covariates
We study the effect of misclassification of a binary covariate on the parameters of a logistic regression model. In particular we consider 2 2 2 tables. We assume that a binary covariate is subject to misclassification that may depend on the observed outcome. This type of misclassification is known as (outcome dependent) differential misclassification. We examine the resulting asymptotic bias o...
متن کاملRandom Partition Models with Regression on Covariates.
Many recent applications of nonparametric Bayesian inference use random partition models, i.e. probability models for clustering a set of experimental units. We review the popular basic constructions. We then focus on an interesting extension of such models. In many applications covariates are available that could be used to a priori inform the clustering. This leads to random clustering models...
متن کاملNonparametric Regression with Nonparametrically Generated Covariates
In this paper, we analyze the properties of nonparametric estimators of a regression function when some covariates are not directly observed, but have only been estimated by some nonparametric procedure. We provide general results that can be used to establish rates of consistency or asymptotic normality in numerous econometric applications, including nonparametric estimation of simultaneous eq...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computational Statistics & Data Analysis
سال: 2022
ISSN: ['0167-9473', '1872-7352']
DOI: https://doi.org/10.1016/j.csda.2021.107315